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Abstract. We consider the online machine minimization problem in which jobs with 
hard deadlines arrive online over time at their release dates. The task is to determine 
a feasible schedule on a minimum number of machines. Our main result is a general 
Cl(m^ log m)-competitive algorithm for the preemptive online problem, where m is 
the optimal number of machines used in an offline solution. This is the first improve¬ 
ment on an (!?(log(pniax/Pmin))-competitive algorithm by Phillips et al. (STOC 1997), 
which was to date the best known algorithm even when m = 2. Our algorithm is 0(1)- 
competitive for any m that is bounded by a constant. To develop the algorithm, we in¬ 
vestigate two complementary special cases of the problem, namely, laminar instances 
and agreeable instances, for which we provide an 0(log m)-competitive and an 0(1)- 
competitive algorithm, respectively. Our 0(l)-competitive algorithm for agreeable 
instances actually produces a non-preemptive schedule, which is of its own interest 
as there exists a strong lower bound of n, the number of jobs, for the general non- 
preemptive online machine minimization problem by Saha (FSTTGS 2013), which 
even holds for laminar instances. 


1 Introduction 

We consider the fundamental problem of minimizing the number of machines that is nec¬ 
essary for feasibly scheduling jobs with release dates and hard deadlines. We consider the 
online variant of this problem in which every job becomes known to the online algorithm 
only at its release date. We denote this problem as the online machine minimization prob¬ 
lem. We will show that we may restrict to the semi-online problem variant in which the 
online algorithm is given the optimal number of machines, m, in advance. 

Our main result is a 0{m‘^ log mj-competitive algorithm for the preemptive online ma¬ 
chine minimization problem. This is the first improvement upon a O(log )-competitive 
algorithm m- Our competitive ratio depends only on the optimum value, m, instead of 
other input parameters. In particular, it is constant if the optimum is bounded by a fixed 
constant. Such a result was not known even for m = 2. 

Previous results. The preemptive semi-online machine minimization problem, in which 
the optimal number of machines is known in advance, has been investigated extensively by 
Phillips et al. and there have hardly been any improvements since then. Phillips et al. 
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show a general lower bound of | and leave a huge gap to the upper bound 0(log on the 
competitive ratio for the so-called Least Laxity First (LLF) Algorithm. Not so surprisingly, 
they also rule out that the Earliest Deadline First (EDF) Algorithm may improve on the 
performance of LLF; indeed they show a lower bound of ). It is a wide open question 

if preemptive semi-online machine minimization admits a constant competitive ratio or 
even a competitive ratio independent of the number of jobs or processing times, e.g., an 
/(mj-competitive algorithm for some function /. It is not even known whether a constant 
competitive algorithm exists for m = 2. 

The non-preemptive problem is considerably harder than the preemptive problem. If the 
set of jobs arrives online over time, then no algorithm can achieve a competitive ratio sublin- 
ear in the number of jobs |18] . However, relevant special cases admit online algorithms with 
small constant worst-case guarantees. The problem with unit processing times was studied 
in a series of papers [9l ll4l[T5lll81ll9j and implicitly in the context of energy minimization 
in [4]. It has been shown that an optimal online algorithm has the exact competitive ra¬ 
tio e ~ 2.72 mi- For non-preemptive scheduling of jobs with equal deadlines, an upper 
bound of 16 is given in [5]. We are not aware of any previous work on online machine min¬ 
imization restricted to agreeable instances. However, in other contexts, e.g., online buffer 
management m and scheduling with power management mm, it has been studied as an 
important and relevant class of instances. 

In a closely related problem variant, an online algorithm is given extra speed to the 
given number of machines instead of additional unit-speed machines. The goal is to find 
an algorithm that requires the minimum extra speed. This problem seems much better 
understood and speedup factors below 2 are known (see [ami Hz]). However, the power 
of speed is much stronger than that of additional machines since it can be viewed to allow 
parallel processing of jobs to some extent. None of the algorithms that are known to perform 
well for the speed-problem, e.g., EDF and LLF, are /(mj-competitive for any function / for 
the machine minimization problem. 

We also mention that the offline problem, in which all jobs are known in advance, can be 
solved optimally in polynomial time if job preemption is allowed Again, the problem 
complexity increases drastically if preemption is not allowed. In fact, the problem of deciding 
whether one machine suffices to schedule all the jobs non-preemptively is strongly NP- 
complete [10] . It is even open if a constant-factor approximation exists; a lower bound of 2—e 
was given in [5]. The currently best known non-preemptive algorithm has an approximation 
factor of 0{,y (log n)/(log log n)) [7]. Small constant factors were obtained for special cases [5J 
[20] . However, when slightly increasing the speed of the machines, then also the general 
problem can be approximated within a factor 2 [12] . 

Our Contribution. Our main contribution is a new preemptive online algorithm with a 
competitive ratio 0(m^ logm), where m is the optimum number of machines. It is the first 
improvement upon the O(log ^sia^)-competitive algorithm by Phillips et al. [17]. Specifically, 
if the optimum value m is bounded by a constant, our algorithm is (!I(l)-competitive. 

We achieve this algorithm by firstly studying two complementary special cases of the 
problem, namely, laminar instances and agreeable instances. In a laminar instance, if the 
feasible time intervals for processing (between release date and deadline) of any two jobs 
overlap, then one interval is completely contained in the other. In an agreeable instance, 
however, the relative order of release dates coincides with that of the corresponding dead¬ 
line. We provide an 0(log m)-competitive algorithm for laminar instances and an 0(1)- 
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competitive algorithm for agreeable instances. Then we combine both techniques to derive 
an 0{rn? logm)-competitive algorithm for the general problem. 

It is not difficult to see that we may assume that the optimum number of machines m 
is known, i.e, the semi-online model, and that jobs with a small processing time relative to 
the entire time window (“loose” jobs) are easy to schedule. For agreeable instances we show 
that when scheduling “tight” jobs simply in the middle of their time windows, then there 
are at most 0{m) jobs running at the same time. Our (!I(l)-competitive algorithm actually 
produces a non-preemptive schedule. This result is of its own interest as there exists a 
strong lower bound of n for the general non-preemptive (semi)-online machine minimization 
problem [18], which even holds for the special case of laminar instances. 

The most difficult special case seems to be the scheduling of “tight” laminar jobs. Here 
we separate the job-to-machine assignment from the scheduling procedure. Upon arrival, 
we assign a job irrevocably to a single machine and simply run EDF on each machine 
individually. Thus we restrict ourselves to non-migratory schedules, which allows us to keep 
a better control over the remaining processing capacity in a time interval. To assign a job to 
a machine we consider on each machine the previously assigned smallest job that is relevant 
at this point in time, and we find one whose laxity is large enough to fully cover the new job’s 
entire time window. We then assign the new job to the same machine. Fitting a job’s full 
time window (instead of only the processing volume) into the laxity of a previously assigned 
job may seem rather restrictive. But jobs are “tight”, i.e., they contribute a significant 
processing volume to the interval, and we gain much more structure for the analysis. A 
greedy assignment turns out to fill up laxities too aggressively. To slow down this process 
we employ a more sophisticated balancing scheme in which the laxity of a job is divided 
into evenly-sized bins and jobs are distributed carefully over these bins. Our analysis shows 
that the algorithm is O(log m)-competitive. 

For the general setting it is tempting to use the previous algorithms as a black box. 
However, it is unclear if an online partitioning into a small number of agreeable and laminar 
sub-instances is possible. Instead, we propose a more sophisticated variant of the assignment 
procedure for the laminar case, losing an additional factor of m. Instead of assigning a new 
job to a single machine, we form 0(wf logm) groups of jobs. Each of these groups is then 
scheduled on 0{m) machines by generalizing the idea from the agreeable setting. 

Outline. In SectionjS] we define the problem and give first structural insights to the problem. 
We give an (!I(l)-competitive algorithm for agreeable instances in Section[3]and an Oilogm)- 
competitive algorithm for laminar instances in Section|4| In Section[5l we show how to extend 
the techniques for the special cases to an Oirnf log mj-competitive algorithm for the general 
problem. 


2 Preliminaries 

Problem Definition. Given is a set of jobs J = {1, 2, ..., n} where each job j € J has a 
processing time pj G N, a release date rj G N which is the earliest possible time at which 
the job can be processed, and a deadline dj G N by which it must be completed. The task 
is to open a minimum number of machines such that there is a feasible schedule in which 
no job misses its deadline. In a feasible schedule each job j G J is scheduled for pj units 
of time within the time window [rj,dj). Each opened machine can process at most one job 
at any time, and no job is running on multiple machines at the same time. Unless stated 
differently, we allow job preemption, that is, a job can be preempted at any moment in time 
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and may resume processing later on the same or any other machine. When preemption is 
not allowed, then a job must run until completion once it has started. 

To evaluate the performance of our online algorithms, we perform a competitive analysis 
(see e.g. m)- We call an online algorithm A c-competitive if toa machines with ruA < c ■ m 
suffice to guarantee a feasible solution for any instance that admits a feasible schedule on m 
machines. 

Notation. For any job j G J, the laxity is defined as £j = dj —rj —pj- We call a job a-loose, 
for some a < 1, if pj < a{dj — Vj) and a-tight otherwise. The (processing) interval of j is 
I[j) = [rj,dj). For a set of jobs S', we define I{S) = Ujgs/(j). For / = where 

[oi, 5i),..., [ofe, bk) are pairwise disjoint, we define the length of I to be |J| = ~ “*)■ 

Characterization of the Optimum. For I as above, the contribution of a job j to / 
is C[j,T) := max{0, \I fl I{j)\ — £j}, that is, the minimum processing time that j must 
receive in I in any feasible schedule. The contribution of a job set S to / is the sum of the 
individual contributions of jobs in S, and we denote it by C{S,I). Clearly, if S admits a 
feasible schedule on m machines, (^(S, J)/|/| must not exceed to. Interestingly, this bound 
is tight, which we prove in the appendix. 

Theorem 1. Let to be the minimum number of machines needed to schedule instance J 
feasibly. Then there exists a union of intervals I with |"C'(J,/)/|/|] = to but none with 

rc(j,/)/|/n >TO. 

Reduction to the Semi-Online Problem. We show that we may assume that the op¬ 
timum number of machines to is known in advance by losing at most a factor 4 in the 
competitive ratio. To do so, we employ the general idea of doubling an unknown parame¬ 
ter [6]. More specifically, we open additional machines whenever the optimum solution has 
doubled. 

Let Ap(rn) denote a p-competitive algorithm for the semi-online machine minimization 
problem given the optimum number of machines to. Further, denote by m(t) the minimum 
number of machines needed to feasibly schedule all jobs released up to time t. Then our 
algorithm for the online problem is as follows. 

Algorithm Double: 

— Let to = min^-gj rj. For t = 1, 2,... let ti = min{t | m{t) > 2m{ti-i)}. 

— At any time ti, t = 0,1,..., open 2pm{ti) additional machines. All jobs with Vj G [ti-i, U) 
are scheduled by Algorithm Ap{2m(ti-i)) on the machines opened at time ti_i. 

Since the time points to, ti,... as well as m{to), TO(ti),... can be computed online and Ap 
is assumed to be an algorithm for the semi-online problem, this procedure can be executed 
online. Notice that Double does not preempt jobs which would not have been preempted by 
Algorithm Ap. 

Theorem 2. Given a p-competitive algorithm for (non-)preemptive semi-online machine 
minimization, Double is Ap-competitive for (non-)preemptive online machine minimization. 
In the rest of the paper we will thus be concerned with the semi-online problem. 

Scheduling Loose Jobs. We show that, for any hxed a < 1, a-loose jobs are easy to 
handle via a simple greedy algorithm called Earliest Deadline First (EDF) that schedules 
at any time m' = pm unfinished jobs with the smallest deadline. If every job is feasibly 
scheduled, it is p-competitive. 
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Theorem 3. Let a € (0,1). EDF is 1/(1 — a)^-competitive for any instance that consists 
only of a-loose jobs. 

As we aim for asymptotical competitive ratios in this paper, we can from now on assume 
that all jobs are a-tight for a fixed a G (0,1). 

3 A Constant-Competitive Ratio for Agreeable Instances 

Consider an instance J in which any two jobs j,j' G J are agreeable, that is, rj < rp 
implies dj < dj/. We also call J agreeable. For such instances, we derive an 0(1 (-competitive 
online algorithm. By Theorem [5J we may assume that the optimum value m is known in 
advance. Using Theorem^ we can choose some constant a G (0,1) and schedule a-loose jobs 
by EDF on a separate set of machines. It remains to show an 0(l)-competitive algorithm 
for a-tight jobs. 

We define the ^-interval of a job j, for some /3 < 1/2, as Ip{j) ■= [vj -F f3ij,dj — fdij). 
We use the following simple non-preemptive algorithm. 

Algorithm MediumFit: Each job j is scheduled in the middle of its feasible time window, 
i.e., during /i/ 2 (j). If for some job j there is no vacant machine during /i/ 2 (j), then we open 
a new machine. 

For the analysis of MediumFit, let two jobs j,j' be jd-agreeable if they are agreeable and 
their /3-intervals have a non-empty intersection. We give an upper bound on the number of 
jobs that can be /3-agreeable with a single job. In fact, we will prove a stronger result than 
actually needed for this section, but it will be useful for analyzing the general algorithm in 
Section O 

Lemma 1. Consider a job j and some fixed a G (0,1) and /3 e (0,1/2]. Then there exist at 
most 0{m) a-tight jobs j' with ly < ij such that j and j' are fd-agreeable. 

Proof. Consider all such j'. We estimate their contributions to the interval(s) 

I = [rj — 2£j , rj -F 2£j ) U [dj — 21 j , dj -F 2£j ), 

which has a total length of at most 8£j. There are two possibilities for j'. 

Case 1: We have ]/(/')! ^ I{j') contains rj or dj, we have |/n/(j')| > 2£j. Given 

that £j/ < £j, j' contributes at least 2£j — £ji > £j to I. 

Case 2: We have \l\j')\ < 2£j. As a consequence I{j') C I. Observe that |/(/)| > ld£j 
holds because j and j' are /3-agreeable. As j' is a-tight, its contribution to I is 
at least al3£j. 

Let ni and n 2 be the number of jobs corresponding to the above two cases, respectively. 
Then the contribution of the ni -F n 2 jobs to I is at least 


(ni -F a/3n2) • £j > (ni -F ^ 2 ) • al3£j. 

Using Theorem [U the total contribution is upper bounded by m\I\ < 8m£j, implying ni -F 
712 < Smfafi = 0{m). □ 
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Now, consider the MediumFit schedule. Any two jobs that are processed at the same 
time are /3-agreeable for jS = 1/2. By Lemma [U the number of such jobs—and, thus, the 
number of required machines—is 0{m). Therefore, we have an O(l)-competitive algorithm 
for agreeable instances. 

Notice also that our final schedule is non-preemptive. MediumFit (for a-tight jobs) is 
by definition a non-preemptive algorithm. EDF (for a-loose jobs) on agreeable instances 
never preempts jobs that have already started because the jobs released later have a larger 
deadline. Since we compete with a weaker optimum in the non-preemptive setting, our result 
carries over to this setting. 

Theorem 4. On agreeable instances, there is an 0(1)-competitive algorithm for preemptive 
and non-preemptive online machine minimization. 


4 0(log Tn)-Competitiveness for Laminar Instances 

In this section, we consider laminar instances in which for any two jobs j, j' with overlapping 
intervals, that is, |/(j) n/(j')| > 0, holds that either I{j) C /(/') or /(/) A I{j). We prove 
the following result. 

Theorem 5. On laminar instances, there is an O{log m)-competitive algorithm for online 
machine minimization. 

We assume that jobs are indexed from 1 to n in accordance with their release dates, that 
is, j < f implies rj < rji. If rj = rji, we assume that j < j' implies dj > dp. We say that j 
dominates j' (denoted as j >- j') if j < j' and I{j) A I{j'). We also say j' is dominated by 
j and denote this as j' -< j. 

4.1 Description of the Algorithm 

By Theorems [2] and [3l it again suffices to consider semi-online scheduling of a-tight jobs for 
some fixed a G (0,1). 

Job Assignment. We open m' machines and will later show that we can choose to' = 
O(TOlogTO). At every release date, we immediately assign a new job to its machine and we 
never revoke this decision. Thus, we will obtain a non-migratory schedule. 

The assignment procedure is as follows. We consider an unassigned new job j. If there 
is a machine that is “surely free” at time rj, that is, there is no job j' whose time window 
I{j') contains rj, then we assign j to this machine. Otherwise we do the following. 

We consider all previously assigned jobs j' with rj G I{j')- Since the instance is laminar, 
there is on each machine a unique -<-minimalioh among them (unless intervals have identical 
time intervals, in which case we pick the one with the smallest index), that is, a job that 
is not dominating any other job on this machine. Essentially, this is the job that has the 
smallest interval length I{j') on a machine. We call each such job candidate. Now, consider 
the laminar chain Ci{j) -< ■■■ -< Cm>{j) of all candidates at time rj. Again, this chain 
exists because the instance is laminar. We call Ci{j) the Tth candidate of j. Notice that 
I{j) O I{ci{j)) for all i, that is, j is dominated by all its candidates. 

We want to assign j to one of the machines and need to ensure that it can feasibly 
be scheduled. To that end, we will find a candidate Ci{j) that has enough capacity (i.e., 
laxity) in its time window to fit the full time interval I{j) and assign j to this candidate 
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Ci{j). Clearly, we have to carefully keep track of the contribution of previously assigned jobs 
to Ci(j) which reduces the remaining capacity (i.e., the laxity) within the candidate’s time 
window. However, a greedy assignment of jobs to their -<-minimal candidates fails, and we 
employ a more sophisticated balancing scheme. 

We partition the laxity of each candidate’s time window into m' equally-sized bins. To 
assign j, we select the smallest i such that the i-th bin of the I’th candidate Ci(j) still has 
a remaining capacity of at least |/(j)|. If there is no such i then the assignment of j fails. 

To make this assignment more precise and to enable us to analyze the procedure later, we 
introduce some notation. We denote a job j that has been assigned to its Tth candidate Ci{j) 
as the i-th user of Ci{j). We denote the set of all Tth users of a job j' by Ui{j'), for any i. 
All i-th users of j' will contribute to the Tth bin of j'. As stated earlier, to assign j, we 
select the smallest i such that the i-th bin of Ci(j) still has a remaining capacity of at least 
|/(j)|, that is, 

|/(C/.(c.(j)))U/(j)|<%^. (1) 

m' 

If we find such an i and assign j, then the capacity of the f-th bin of Ci{j) reduces by |/(j)|. 
Otherwise, the assignment of j fails. 

Scheduling. Given a job-to-machine assignment, we run EDF on each machine separately. 


4.2 Analysis of the Algorithm 

We first show that our algorithm obtains a feasible schedule if no job assignment has failed. 
Then we give a proof of the fact that the job assignment never fails on instances that admit 
a feasible schedule. 

Lemma 2. If the job assignment does not fail, our algorithm schedules feasibly. 

Proof. Consider some job j and note that, whenever it is preempted at some time t, there 
must be a user j' of j with t G /(/)• According to Inequality ([ij, j is thus preempted for 
no longer than 

m' 

Therefore, it can be processed for pj time units. □ 

It remains to show that the job assignment never fails for some m' = 0{m log m). The proof 
idea is as follows: We assume that the algorithm fails to assign a job. We select a critical job 
set, the failure set, which is the set of jobs that contribute (directly or indirectly) to the m'- 
th bin of the m'-th candidate of the failing job. We take into account the direct contribution 
by users of that bin and the indirect contribution by jobs that filled lower-indexed bins of 
lower-indexed candidates and thus indirectly contributed to the filling of the last possible 
bin. With this failure set we derive a contradiction using a load argument. 

Let j* be a job whose assignment fails. We iteratively define its failure set F C J as 
the union of the m' -|- 1 different sets Fq, ... ,Fm'- To define the base set Fq, we also use 
auxiliary sets Fq, ..., F™ . 

We initialize FJ" := {j*}. Given Fg, we construct Fi and Fq“^ in the following way. 
First, Fi is defined to be the set of all -<-maximal Fth candidates of jobs in Fg. Subsequently, 
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Fq ^ is constructed by incrementing the set of users in Fq by the I’th users of jobs in Fj. 
Formally, 


F, := ({c.(j) I j e Fo*}) 

and Fo*-i := F* U (J Ui{j), 

jeFi 

where is the operator that chooses the -<-maximal elements from a set of jobs: (S) := 

{j G F I $j' : j ^ j'}. After m' such iterations, that is, when Fq , Fi,..., F^' have been 
computed, we set Fq := M^{Fq). Our failure set is Fq U Fi U ... U F^/. 

We show some properties of failure sets which will be useful for the proofs later. To that 
end, we define the following notation. For two job sets Si and S' 2 , we write -< S 2 if, for 
each ji G Si, there is a j 2 G S '2 with ji -< j 2 - Moreover, a job set S is said to j-block a job 
j if there is a subset S' C S with S' -< {j} and |/(S')| > j£j. Finally, Si ^-blocks S 2 if it 
7 -blocks every job in S 2 . 

Lemma 3. Every failure set F has the following properties. 

(i) We have Fq ^ ^ F^'- 

(ii) The sets Fq, ..., Fm' are pairwise disjoint. 

(Hi) For all i = 1,... ,m', Fq ^^-bloeks Fi. 

Proof. To see (i), we define Ci := {ci{j) \ j G Fq}, for every 1 < i < m', and Co ■= Fq. 
We first show Ci-i -< Ci for every 1 < z < m'. For i = 1, this directly follows from the 
construction. Consider 2 < i < m!. Let j be an arbitrary job in Ci-i, which is then an 
(z — l)-th candidate of some job in Fq“^, say, job j'. According to the construction, we have 
j' G F}, or j' is the z-th user of some job in Ci. In both cases, Ci contains the z-th candidate 
of job j', which dominates job j. Hence, Ci-i -< Ci. Since Fi_i and Fi are obtained from 
Ci-i and Ci only by deleting dominated jobs, Fi_i ^ Fi follows. 

Consider property (ii) and suppose there is some j G Fj fl F^/ for some i < i'. Then, by 
(i), there is also a job j' G Fi' with j -< j', contradicting the fact that Fi' only contains 
^-maximal elements. 

Consider property (iii). Let 1 < z < m' and j' G Fj be an arbitrary job. We show that 
there is some Fq C Fq with Fq ^ {/} and |/(Fq)| > ip jm'. By the construction of Fj, we 
have j' = Ci(j) for some j G F}. Recall the construction of Fq: We have j = j*, or j is 
the z'-th user of some job where z' > z -|- 1. In both cases, as our algorithm assigns j to the 
machine of the smallest possible candidate, j could not be assigned to the same machine as 
j', which is its z-th candidate. Thus, when our algorithm assigned job j, the capacity of the 
z-th bin of j' did not have enough remaining capacity for j to become its user. Hence, 

Note that Ui{j') U {j} C Fq holds and that Ui{j') U {j} consists of pairwise disjoint jobs. 
As Fq is obtained by only deleting jobs in Fn that are dominated by others, we can always 
find F^ C Fq with |/(F^)| > ip/m'. □ 

Using a load argument, we can even derive stronger blocking relations. 

Lemma 4. For any fixed a G (0,1), there exists k = 0{m) with the following property. If 
Fi ^-bloeks Fh, then Fi+k 2j-blocks Fh, for all h > i + k. 



Proof. Consider some arbitrary j € Fh- For every i' G {i,... ,2 + k}, we denote by F*, the 
subset of Fi! consisting of jobs that are dominated by j. We prove that, for k = 0{m) the 
inequality > 2|/(F*)| holds. By Lemma[3](i) and (ii), we have the chain I{F*) C 

^ m+k)- As all of these jobs are a-tight, we can thus lower bound the processing¬ 
time contribution of Ff, to I{Ff_^^) by a\I{F*,)\ > a\I{F*)\, for all i' G + k}. 

The contributions of all these jobs sum up to at least ka\I{F*)\, which must not exceed 
w|/(F*|_j,)| by Theorem [T] Hence, for k > ^mja = 0{m) we have |d(F’j^^.)| > 2|/(A*)|, 
which implies the claim. □ 

Lemma 5. There exists m' = 0{m\ogm) such that the assignment never fails. 

Proof. We will choose m! = 0{m log m) such that we get a contradiction to the existence of 
the failure set F. Using Lemma |3] (hi) and Lemma SI we can choose A = O(mlog77i') such 
that F\ 2-blocks every Fi with i > X. Now we lower bound the total contribution of these 
Fi to I{F\): For any j G Fi where i > X, its contribution to I{F\) is 


\m)niU)\-i,>\iiFx)niij)\/2 


since |/(F'a) C /(j)| > 2£j. Thus we get as total contribution of Fi to F\ at least 


E 

jeFi 


\i{Fx)ri{j)\ ^ 1 
2 - 2 


/(Fa) n ( IJ I{j)) 

jeFk 


|/(F,)n/(F0| ^ \IiFx)\ 


where the last inequality follows from F\ -< Fi. Therefore, the total contribution of all Fi 
for i > A is at least (m' — A)|/(Fa)|/ 2, which is at most m|/(FA)| by TheoremS] Thus, we 
must have m' < A-I-2m = O{m\ogm'), and appropriately choosing m' = O{mlogm) yields 
a contradiction. □ 


As stated earlier, combining this lemma with Lemma [2] shows Theorem [5l 


5 log m)-Competitiveness in the General Case 

Now, we generalize the methods introduced above and prove our main result. 

Theorem 6. There is an 0{mf log m)-competitive algorithm for online machine minimiza¬ 
tion. 

It is not clear if an online separation into a reasonable number of laminar and agreeable 
sub-instances exists such that we could utilize the previous algorithms as black boxes. Nev¬ 
ertheless, we take over the notion of domination from the laminar case and slightly extend it 
using the previously introduced definition of ^-intervals hij) ■= [vj -f 6£j, dj — S£j). We say 
job j 5-dominates job j' if job j dominates j' and we have I{j') fl Is{j) ^ 0. The presented 
algorithm works for any fixed 6 G (0,1/2). 


5.1 Description of the Algorithm 

Again, by Theorems H] and [31 it suffices to consider semi-online scheduling of a-tight jobs 
where a is fixed. We describe an 0(m^ logm)-competitive algorithm. In its structure, our 
algorithm mainly resembles the previous algorithm for laminar instances and it incorporates 
ideas from the one for agreeable instances. 
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Job Assignment. Instead of assigning each job immediately to a fixed machines, we form 
g initially empty groups of jobs. We will later choose g = O(m^logm) and show that 
each of these groups can be scheduled on 0{m) machines. Upon its release, each job j is 
immediately assigned to one of the groups based on the assignment of the previously released 
(5-dominating jobs. If there exists a group in which no job ^-dominates j, then j is assigned to 
such a group. Otherwise, in contrast to the laminar case, there might be multiple ^-minimal 
such jobs in each group. Among these jobs we now pick the one with the minimum laxity 
from each group, and we denote this set of g jobs as D. 

Like in the laminar case, we choose r candidates, where t is independent of j and will be 
specified later. Again, these jobs will form a chain ci(j) -< • • • ^ Cr{j), where we call Ci{j) the 
i-th candidate of j, for all i. Recall that this step is straightforward for a laminar instance 
as the jobs in D form such a chain already. Here, however, this chain is constructed in an 
iterative way, starting with iteration 0: In iteration z, we define Cr-i(j) to be a maximum- 
laxity job among those jobs in D dominated by all jobs selected so far. If no such jobs exist 
any more before we have found the r-th candidate, the job assignment fails. 

Analogous to the laminar case, we pick a candidate Ci{j) and assign j to the same group. 
As before, j will be called an z-th user of Ci{j) after this assignment, and the set of h-th 
users of some job j' is denoted by Uh{j'), for all h. We open r equally-sized bins for each 
job, where we again pack intervals of its h-th users into the h-th bin, for all h. In contrast 
to the laminar case, the sum of the bin sizes of a job is only a small fraction of its laxity: 
Here, we choose the smallest candidate Ci{j) such that 

|/(C/,(q(j)))U/(j)|<^, 

qmr 

where <7 is a constant yet to be determined. The job assignment fails if we do not find such 
a candidate. Note that, again in contrast to the laminar case, users of the same job may 
overlap, that is, \I{Ui{ci{j))) U /(j)| is only a lower bound on the sum of the individual 
interval lengths of jobs in Ui{ci{j)) U {j}. 

Scheduling. Consider a group and let S' C J be the jobs assigned to it. We schedule S 
on 0{m) machines in the following way. Similarly to our algorithm for agreeable instances, 
each job j S S is scheduled only within h(j). More precisely, at any time t consider all 
the unfinished jobs j £ S with t £ Isij)- Among all these jobs, we schedule exactly the ^- 
minimal jobs, each one on a separate machine. Note that our algorithm for laminar instances 
also schedules exactly the ^-minimal jobs on each machine, which coincides with EDF for 
laminar instances. 


5.2 Analysis of the Algorithm 

As in the laminar case, we first show that, if the job assignment never fails, we obtain a 
feasible schedule for all jobs. We need the following auxiliary lemma. 

Lemma 6. Let j and j' be two jobs that do not dominate one another but both 5-dominate 
some job j*. Then j and j' are {5/2)-agreeable. 

Proof. We show that 

£nU). ( 2 ) 

If t* £ IsU) holds. Equation ([2]) directly follows. Otherwise, let I be the largest interval 
that contains P and is a subset of I{j*) \ IsU)- Note that either rj* borders I to the left or 
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dj* borders I to the right. W.l.o.g. assume that I = [t, dj* ) for some t > Vj*. Now we have 
t* — t < \I\/2 < 5ijl2. As Is/ 2 {j) is obtained from Is{j) by enlarging it by S£j/2 in both 
directions, Equation @ follows. 

By the same argument, we can prove t* S Is/ 2 ij')y the claim follows. □ 

Now we can prove the following statement, where we will pick a suitable q. 

Lemma 7. If the job assignment does not fail, our algorithm produces a feasible schedule 
on 0{mg) machines. 

Proof. Consider an arbitrary job j, assigned to group h. We first prove that j receives a 
processing time of pj, that is, it is preempted for at most (1 — 26)£j within Isij). Consider 
an arbitrary time t G hU) when j is preempted, and let St be those jobs j' from group h 
with t G Isij')- According to the scheduling rule, j is not a ^-minimal job in St, so there 
exist jobs in St which are dominated, and thus ^-dominated, by j. Among them, let j* be a 
^-maximal one, and consider the time when job j* is assigned to group h. Then job j is one 
of the ^-minimal jobs that i5-dominate j* in group h. According to the algorithm, among 
all the ^-minimal jobs that ^-dominate j* in group h, the one of the least laxity is picked 
and added to the chain of r candidates of j*, and j* is eventually assigned (as a user) to 
this candidate. Thus, job j* is either a user of job j, or a user of some other job j' with 
£j' S: £j which is also a -<-minimal job that (5-dominates j*. 

With the above argument, we conclude that job j is preempted either due to one of 
its own users or a user of some other job j' from group h such that £j' < £j. By our job 
assignment, this preempts j by at most £j/iqm) or £jifiqm) < £j/iqm), respectively. We 
claim that there are at most 0{m) such j'. To see why, notice that according to Lemma [5] 
jobs j and j' are 5/2-agreeable. As £j> < £j, according to Lemma [T] the claim follows. Thus 
in total job j is preempted by at most 0(m) ■£j j[qm) < (1 — 2S)£j, with q being a sufficiently 
large constant. 

It remains to show that, in the same group h, we process at most 0(m) jobs at any time 
t. This is straightforward: Let St be as above. Note that we always process the ^-minimal 
jobs in St, and any two of them are ^-agreeable. Hence, according to Lemma [TJ there are at 
most Oim) such jobs. □ 

We now show that the job assignment never fails. In contrast to the laminar case, we 
also have to make sure that it never fails at finding a chain of candidates. 

Lemma 8. We can choose t = f2{gfm), and our algorithm will never fail at finding a chain 
of T candidates. 

Proof. Let S be some set of jobs that all d-dominate the same job j*. Let job j be the one 
with maximum laxity in S. Note that is suffices to show that, except for Oim) jobs, all the 
jobs in S are dominated by j. Then, every time when we pick a new candidate, we lose Oim) 
jobs, and with some t = flig/m) the assignment procedure does not fail when selecting the 
chain of candidates. 

Consider any job j' G S which is not dominated by j. If job j' does not dominate j 
either, then due to the fact that jobs j and j' both 5-dominate j*, j' is ((5/2)-agreeable with 
j according to Lemma [51 By applying Lemma [H we get that there are at most Oim) such 
jobs j'. Otherwise j' dominates j. As we have £j' < £j and /(/) A /(j) for each such j', it 
follows that j' contributes at least pj to I if). Using that every job is a-tight, Theorem |T] 
now allows us to bound the number of these jobs by m/a = Oim), too. □ 
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The remainder of the proof again relies on selecting a failure set whenever a job as¬ 
signment fails and deriving a contradiction from that. The arguments are the same as in 
Sectional Thus, we omit them here and refer to the appendix. 
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A Missing Material from Section 

A.l Proof of Theorem [T] 

Proof (Theorem [Jj). It is easy to see that C{J,I) < m\I\ since the processing volume of 
C{J,I) must be hnished within / of length |J| in any feasible solution, while the optimal 
solution can finish at most m|/| by using m machines. 

It remains to show that there exists some I* such that C{J,I*)/\I*\ > m — \. Notice 
that the offline scheduling problem can be formulated as a flow problem, and there exists an 
optimal schedule which makes decisions only at integral points in time. Consider any such 
schedule that uses m machines. We denote by Xh the number of time slots [t,t + 1) during 
which exactly h machines are occupied. Thus, for any optimal schedule on m machines that 
makes decisions only at integral points in time, we obtain a vector y = (xm, • • ■ ,Xo)- We 
dehne a lexicographical order < on these vectors: we have X < if only if there exists 
some h with Xh < x'h Xi = XiJ for all i < h. We pick the optimal schedule whose 
corresponding vector is the smallest with respect to <. 

We now construct a directed graph G = (P, A) based on the schedule we have picked. 
Let V = {r^o,. •. where Vt represents the slot [t,t + 1). Let (ft be the number of 

machines occupied during [t,t + !)■ An arc from Vi to Vk exists iff (ft > (fk and there exists 
some job j with [i, i + 1) U [/c, fc + 1) C /(j) whereas j is processed in [i, i + 1) but not in 
[fc, k + 1). Intuitively, an arc from Vi to Vk implies that one unit of the workload in [i, z + 1) 
could be carried to [k,k-\- 1). 

We claim that in G there is no (directed) path which starts from some Vi with (ft = m, 
and ends at some vg with (fg < m — 2. Suppose there exists such a path, say ... ,Vig.) 
with = m and (ft^ < m — 2. Then we alter the schedule such that we move one unit of 
the workload from [is,is + 1) to [zs+i,Zs+i + 1), for all s < £. By doing so, (fi^ decreased 
and (fii increased by 1 each. By = m and (fi^ < m — 2, we get that Xm decreases by 1, 
contradicting the choice of the schedule. 

Consider Vm = {vi \ (ft = m} and let P{Vm) = ■. • iZ’if} be the set of vertices 

reachable from Vm via a directed path (trivially, Vm C P(Vm))- The above arguments imply 
that for any Vi G P{Vm), it holds that (ft > m — 1. We claim that J2j C'(j, I*) = J2h for 
I* = {[ih,ih + 1 ) I 1 < h < .^}. Note there is no arc going out from P{Vm) (otherwise the 
endpoint of the arc is also reachable and would have been included in P{Vm))- That is, we 
cannot move out any workload from /*, meaning that the contribution of all jobs to J* is 
included in the current workload in I*, i.e., '^ih- Thus, 

c{j,n ^ 

|/*| - |/*| \P{Vm)[ 

Notice that (fi^^ is either m or to — 1, and among them there is at least one of value to, thus 
the right-hand side is strictly larger than to — 1, i.e., G{J,I*)/\I*\ > to — 1. □ 

We remark that this result can be proven also using LP duality. 

A.2 Proof of Theorem!^ 

Proof (Theorem Let to,ti,...,tk denote the times at which Double opens new ma¬ 
chines. Double schedules the jobs Ji = {j\rj G with i = 0,1,..., fc, using Al¬ 

gorithm Ap{2m{ti)) on 2pm{ti) machines exclusively. This yields a feasible schedule since 


13 





an optimal solution for Ji requires at most — 1) < 2m{ti) machines and the p- 

competitive subroutine Ap(2m{ti)) is guaranteed to find a feasible solution given a factor p 
times more machines than optimal, i.e., 2pm{ti). 

It remains to compare the number of machines opened by Double with the optimal 
number of machines m, which is at least m{tk). By construction it holds that 2m(ti) < 
which implies by recursive application that 


2m{ti) < 


i{tk) 


The total number of machines opened by Double is 

k k ^ k 


(3) 


J22pmiU) < ^ p-^nrTm(tk) = :^rn{tk) < 4pm 


i=0 


i=0 


i=0 


2 * 


which concludes the proof. 


□ 


A.3 Proof of Theorem [3] 

Recall that EDF is the algorithm that schedules m' = pm unfinished jobs with the smallest 
deadline at any time t. Specifically, if there are multiple jobs with the same deadline, we 
assume that EDF will break the tie arbitrarily. Notice that the release dates, deadlines, and 
processing times of all jobs are integers. Thus, if EDF decides to process a job at time t, it 
actually processes the job during [t, t +1), i.e., EDF does not change its decision at fractional 
time points. Therefore, throughout this subsection, we only consider algorithms that make 
decisions at integral time points and denote by A any such algorithm. 

We give some notations. We denote by wa( 0 the total processing volume of jobs that A 
assigns to [t, t+1) (or simply to t). Specifically, if algorithm A makes decisions only at integral 
time points, this is simply the number of assigned jobs. Further, we let p^{t) denote the 
remaining processing time of job j at time t, if{t) = dj — If it is clear from context, 

we may drop the superscript A and simply write pj (t) and ij (t). We let Wa (t) denote the 
total remaining processing time of all unfinished jobs (including those not yet released) at 
time t, i.e., WA(t) = ~ Ss=o ^a(s). We use OPT to indicate an optimal schedule. 

A job is called active if it is released but not finished. An algorithm for the (semi-)online 
machine minimization problem is called busy if at all times t, WAit) < m' implies that at 
time t there are exactly WA{t) active jobs, where m' is the number of machines used by the 
algorithm. 

We prove Theorem [3] by establishing a contradiction based on the workload that EDF 
when missing deadlines must assign to some interval. We do so by using the following work 
load inequality, which holds for arbitrary busy algorithms. Here, dmax denotes the maximum 
deadline that occurs in the considered instance. 

Lemma 9. Let every job be a-loose, p > 1/(1 — and let A be a busy algorithm for 
semi-online machine minimization using pm machines. Assume that (f{t') > 0 holds, for 
all t' <t and j. Then 

Ra (0 < W 0 PT{t) + -m • (dmax “ t) . 

I — a 

Proof. We prove by induction. The base is clear since Wa{^) = ITopt(O). Now assume the 
lemma holds for all t' < t. There are three possibilities. 
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Case 1: We have w/\{t) < a/{I — a) ■ m. By the fact that A is busy, there is an empty 
machine at time t, meaning that there are at most a/(l — a) ■ m active jobs. 
Given that > 0 for all t' < t, each of the active jobs has a remaining 

processing time of no more than dmax — C implying that W^it) is bounded by 
a/(l — a) • m • (dmax — t) plus the total processing time of the jobs that are not 
released. As the latter volume cannot exceed Wopt(0i f^e claim follows. 

Case 2: We have WA{t) > 1/(1 — a) -m. According to the induction hypothesis, it holds 
that WA(t-l) < WopT(t-l)+a/(l-a)-TO-(dmax-t + l). Using woPT(t) < m, 
we get Wopt(0 Wopj(t — 1) — m. For Algorithm A, it holds that WA{t) = 
WA{t — 1) — WA{t — 1) < Wi\{t — 1) — 1/(1 — a) • TO. Inserting the first inequality 
into the second one proves the claim. 

Case 3: We have a/(l — a) ■ m < wa(0 < 1/(1 — ct) ■ m. Again by the fact that A is 
busy, there is an empty machine at time t, i.e., there are less than 1/(1 — a) ■ m 
active jobs. Again distinguish two cases. 

Case 3a: We have Pj{t) < a{dj —t) for all active jobs. Then the total remaining processing 
time of active jobs is bounded by a(dmax — t) • 1/(1 — a) • to. Plugging in the total 
processing time of unreleased jobs, which is bounded by WopT(i), we again get 
the claim. 

Case 3b: There exists an active job j with Pj{t) > a{dj — t). Using that j is a-loose, i.e., 
Pj < a{dj — Tj), we get that j is not processed for at least (1 — a)(t — rj) time 
units in the interval [rj , t). As Algorithm A is busy, this means that all machines 
are occupied at these times, yielding 

Wa(0 < W^iTj) - (1 - a)(t - rj) ■ pm 

Q 

< IUopt(c) + ■ (<^max “ Tj) - (1 - a){t - rj) ■ pm 

i — a 

Q/ 

< IUopt(c) + ■ (^max - t) - m ■ {t - rj), 

where the second inequality follows by the induction hypothesis for t = rj, and 
the third one follows by p > 1/(1 — a)^. Lastly, the feasibility of the optimal 
schedule implies 

WopT{t) > Wopt(c) -1^- {t-'Tj), 

which in turn implies the claim by plugging it into the former inequality. □ 

We now apply this lemma in the proof of the theorem. 

Proof (Theorem\^. Let pm be the number of machines EDF uses and consider the schedule 
produced by EDF from an arbitrary instance J only consisting of a-loose jobs. We have to 
prove that every job is finished before its deadline if p = 1/(1 — a)^. To this end, suppose 
that EDF fails. Among the jobs that are missed, let j* be one of those with the earliest 
release date. 

First observe that we can transform J into J' C J such that j* G J', max^gj/ dj = dj* 
and EDF still fails on J'. For this purpose, simply define J' = {j G J \ dj < dj*} and notice 
that we have only removed jobs with a later deadline than maxjgj/ dj. This implies that 
every job from J' receives the same processing in the EDF schedule of J' as in the one of J. 

We can hence consider J' instead of J from now on. In the following, we establish a 
contradiction on the workload during the time interval [rj*, dj*). The first step towards this 
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is applying Lemma [5] for an upper bound. Also making use of the feasibility of the optimal 
schedule, we get: 


Ol 

WED¥{rj*) < Wopj{rj*) + -—— ■ m - {d max 
^ Z ' m * (dniax ^j*)- 

1 - a 

We can, however, also lower bound this workload. Thereto, note that job j* is not processed 
for at least (1 — a.)(dj* — rj*) time units where dj* = dmax, implying that all machines must 
be occupied by then, i.e., 


> (1 — o) ■ pm ■ {d max 

If we compare the right-hand sides of the two inequalities, we arrive at a contradiction if 
p>l/(l — a)^, which concludes the proof. □ 

B Missing Material from Section [5] 

B.l Analysis of the Job Assignment 

We show that the algorithm does not fail to assign any job to a group for some r = 
O(mlogm). The basic idea is the same as that for laminar instances, that is, we again select 
out a critical job set if the algorithm fails to assign a job, and we derive a contradiction by 
a load argument from this set. 

In particular, for a fixed job j* whose assignment fails, we iteratively define its failure 
set F C J, which is the union of the t + 1 different sets Fq,. .. ,Fr. To define Fq, we also 
use auxiliary sets Fq, ... ,Fq. 

We initialize Fq := {j*}. Given Fq, we construct Ft and Fq~^ in the following way. First, 
Fi is defined to be the set of all ^-maximal i-th candidates of jobs in Fq. Subsequently, Fg“^ 
is constructed by adding all the i-th users of jobs in Fi to Fg. Formally, 

F, := ({c.(j) I J e Fq*}) 

and Fp*"i := F} U (J Ui{j), 

j&Fi 

where is the operator that picks out the ^-maximal elements from a set of jobs: 
M^{S) := {j e S' I : j j'}. After r such iterations, that is, when Fg, Fi, ... ,Fr have 
been computed, we set Fg := M^(Fq). 

Again, we have the following lemma whose proof is exactly the same as that of Lemma[3] 
Notice that, due to the difference in the bin sizes, here Fg y-blocks each Fi where 7 = 
l/{qmT), which is different from Lemma [31 

Lemma 10. Every failure set F has the following properties: 

(i) We have Fq ^ ^ F^. 

(ii) The sets Fq, ..., Ft are pairwise disjoint. 

(Hi) For all i = 1,... ,T, Fq ^-blocks Fi where 7 = I/(gmr). 

The following lemma is also true via the same proof as Lemma jT] 
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Lemma 11. For any fixed a G (0,1), there exists k = 0{m) with the following property. If 
Fi ^-blocks Fh, then Fi+k 2"f-blocks Fh, for all i + k < h. 

Lemma 12. There exists g = 0(mf\ogm) such that the job assignment never fails. 

Proof. We will choose r = O{m\ogm) such that we get a contradiction to the existence of 
the failure set F. Using Lemma fTOl fiiil and Lemma El we can choose A = 0{m\og{qmT)) 
such that Fx 2-blocks every Fi with i > X. Now we lower bound the total contribution of 
these Fi to I{Fx): For some j G Fi where i > X, its contribution to I{Fx) is 


\I{Fx)nl{j)\-£,> 


\iiFx)ni{j)\ 


since \I{Fx) Ci I{j)\ > 2£j. Thus we get as total contribution of Fi to Fx at least 


E 

j€Fi 


\i{Fx)nm ^ 1 
2 - 2 


liFx) n ( IJ /(j)) 

jeFk 

\I{Fx)nl{F,)\ ^ \I{Fx)\ 


where the last inequality follows from Fx -< Fi. Therefore, the total contribution of all Fi 
for i > A is at least (r — A)|/(Fa)|/ 2, which is at most m\I{Fx)\ by Theorem[TJ Thus, we 
must have r < A + 2m = 0{mlog{qmT)), and appropriately choosing r = C>(mlogm) yields 
a contradiction. By LemmalU we can choose g = 0{mT) = 0{mf logm). □ 

We now obtain Theorem [6] by putting Lemmas [7l and IT^ together. 
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